Prioritized Goal Decomposition of Markov Decision Processes: Toward a Synthesis of Classical and Decision Theoretic Planning
نویسندگان
چکیده
We describe an approach to goal decomposition for a certain class of Markov decision processes (MDPs). An abstraction mechanism is used to generate abstract MDPs associated with different objectives, and several methods for merging the policies for these different objectives are considered. In one technique, causal (least-commitment) structures are generated for abstract policies and plan merging techniques, exploiting the relaxation of policy commitments reflected in this structure, are used to piece the results into a single policy. Abstract value functions provide guidance if plan repair is needed. This work makes some first steps toward the synthesis of classical and decision theoretic planning methods.
منابع مشابه
Accelerated decomposition techniques for large discounted Markov decision processes
Many hierarchical techniques to solve large Markov decision processes (MDPs) are based on the partition of the state space into strongly connected components (SCCs) that can be classified into some levels. In each level, smaller problems named restricted MDPs are solved, and then these partial solutions are combined to obtain the global solution. In this paper, we first propose a novel algorith...
متن کاملDecision - Theoretic Planning : Structural Assumptions andComputational
Planning under uncertainty is a central problem in the study of automated sequential decision making, and has been addressed by researchers in many diierent elds, including AI planning, decision analysis, operations research, control theory and economics. While the assumptions and perspectives adopted in these areas often diier in substantial ways, many planning problems of interest to research...
متن کاملScaling Decision Theoretic Planning
As classical planning branches out to consider richer models, many extensions approach decision theoretic models. Decision theory research uses models like MDPs and POMDPs which are very expressive, but can be difficult to scale. Whereas, planning research concentrates quite a bit on scalability. Our previous work and future doctoral thesis concentrates on extending the planning model toward pr...
متن کاملExploiting Domain Structure in Multiagent Decision-Theoretic Planning and Reasoning
EXPLOITING DOMAIN STRUCTURE IN MULTIAGENT DECISION-THEORETIC PLANNING AND REASONING MAY 2013 AKSHAT KUMAR B.Tech., INDIAN INSTITUTE OF TECHNOLOGY GUWAHATI M.Sc., UNIVERSITY OF MASSACHUSETTS AMHERST Ph.D., UNIVERSITY OF MASSACHUSETTS AMHERST Directed by: Professor Shlomo Zilberstein This thesis focuses on decision-theoretic reasoning and planning problems that arise when a group of collaborative...
متن کاملPlanning and Programming with First-Order Markov Decision Processes: Insights and Challenges
Markov decision processes (MDPs) have become the de facto standard model for decision-theoretic planning problems. However, classic dynamic programming algorithms for MDPs [22] require explicit state and action enumeration. For example, the classical representation of a value function is a table or vector associating a value with each system state; such value functions are produced by iterating...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997